[SPARK-37465][Python][WIP] Raise minimum supported Pandas version to 1.0.0 #34724

rshkv · 2021-11-26T17:17:50Z

What changes were proposed in this pull request?

This raises Spark's minimum supported Pandas version to 1.0.0. If the installed version is below, Spark fails with: "Pandas >= 1.0.0 must be installed; however, your version was ..."

Why are the changes needed?

Some of the Pandas-on-Spark tests do not pass with Pandas < 1.0, see SPARK-37465.

Does this PR introduce any user-facing change?

Yes, users with installed Pandas versions below 1.0 will see failures. Also Pandas 1.0 introduces breaks (listed here) which should not affect Spark's interaction with Pandas, but it might introduce breaks to user environments that get their Pandas version transitively through PySpark.

How was this patch tested?

Existing tests. The Pandas version used in Github actions is 1.3.3. I'll verify tests also pass with 1.0.0.

AmplabJenkins · 2021-11-26T17:19:07Z

Can one of the admins verify this patch?

sarutak · 2021-11-26T19:24:39Z

It's already work in progress in #34717 isn't it?

Yikun · 2021-11-27T01:03:20Z

Yep, as @sarutak metioned, @rshkv would you mind help to review #34717?

rshkv · 2021-11-27T13:15:39Z

Ah didn't see, thanks guys. Will take a look @Yikun.

Raise Pandas minimum version to 1.0.0

3019e38

github-actions bot added CORE PYTHON SQL labels Nov 26, 2021

rshkv changed the title ~~[SPARK-37465][Python] Raise minimum supported Pandas version to 1.0.0~~ [SPARK-37465][Python][WIP] Raise minimum supported Pandas version to 1.0.0 Nov 26, 2021

rshkv closed this Nov 27, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[SPARK-37465][Python][WIP] Raise minimum supported Pandas version to 1.0.0 #34724

[SPARK-37465][Python][WIP] Raise minimum supported Pandas version to 1.0.0 #34724

Uh oh!

rshkv commented Nov 26, 2021 •

edited

Loading

Uh oh!

AmplabJenkins commented Nov 26, 2021

Uh oh!

sarutak commented Nov 26, 2021

Uh oh!

Yikun commented Nov 27, 2021

Uh oh!

rshkv commented Nov 27, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

[SPARK-37465][Python][WIP] Raise minimum supported Pandas version to 1.0.0 #34724

[SPARK-37465][Python][WIP] Raise minimum supported Pandas version to 1.0.0 #34724

Uh oh!

Conversation

rshkv commented Nov 26, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What changes were proposed in this pull request?

Why are the changes needed?

Does this PR introduce any user-facing change?

How was this patch tested?

Uh oh!

AmplabJenkins commented Nov 26, 2021

Uh oh!

sarutak commented Nov 26, 2021

Uh oh!

Yikun commented Nov 27, 2021

Uh oh!

rshkv commented Nov 27, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

rshkv commented Nov 26, 2021 •

edited

Loading